Learning Low Dimensional Invariant Signature of 3-D Object under Varying View and Illumination from 2-D Appearances

نویسندگان

  • Stan Z. Li
  • Jie Yan
  • XinWen Hou
  • ZeYu Li
  • HongJiang Zhang
چکیده

In this paper, we propose an invariant signature representation for appearances of 3-D object under varying view and illumination, and a method for learning the signature from multi-view appearance examples. The signature, a nonlinear feature, provides a good basis for 3-D object detection and pose estimation due to its following properties: (1) Its location in the signature feature space is a simple function of the view and is insensitive or invariant to illumination. (2) It changes continuously as the view changes, so that the object appearances at all possible views should constitute a known simple curve segment (manifold) in the feature space. (3) The coordinates of the object appearances in the feature space are correlated in a known way according to a predefined function of the view. The first two properties provide a basis for object detection and the third for view (pose) estimation. To compute the signature representation from input, we present a nonlinear regression method for learning a nonlinear mapping from the input (e.g. image) space to the feature space. The ideas of the signature representation and the learning method are illustrated with experimental results for the object of human face. It is shown that the face object can be effectively modeled compactly in a 10-D nonlinear feature space. The 10-D signature presents excellent insensitivity to changes in illumination for any view. The correlation of the signature coordinates is well determined by the predefined parametric function. Applications of the proposed method in face detection and pose estimation are demonstrated.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Image Detection Under Varying Illumination and Pose

This paper focuses on the detection of objects with Lambertian surface under both varying dlumination and pose We offer to apply a novel detection method that proceeds by modeling the d@erent illuminations from a small number of images in the training set, this automatically voids the illumination effects, allowing fast dlumination invariant detection, without having to create a large training ...

متن کامل

View-Based Clustering of Object Appearances Based on Independent Subspace Analysis

In 3D object detection and recognition, an object of interest is subject to changes in view as well as in illumination and shape. For image classification purpose, it is desirable to derive a representation in which intrinsic characteristics of the object are captured in a low dimensional space while effects due to artifacts are reduced. In this paper, we propose a method for view-based unsuper...

متن کامل

Efficient detection under varying illumination conditions and image plane rotations q

This paper focuses on the detection of objects with a Lambertian surface under varying illumination and pose. We offer to apply a novel detection method that proceeds by modeling the different illuminations from a small number of images in a training set; this automatically voids the illumination effects, allowing fast illumination invariant detection, without having to create a large training ...

متن کامل

Efficient detection under varying illumination conditions and image plane rotations

This paper focuses on the detection of objects with a Lambertian surface under varying illumination and pose. We offer to apply a novel detection method that proceeds by modeling the different illuminations from a small number of images in a training set; this automatically voids the illumination effects, allowing fast illumination invariant detection, without having to create a large training ...

متن کامل

Fast Learning VIEWNET Architectures for Recognizing 3-D Objects from Multiple 2-D Views

The recognition of 3-D objects from sequences of their 2-D views is modeled by a family of self-organizing neural architectures, called VIEWNET, that use View Information Encoded With NETworks. VIEWNET incorporates a preprocessor that generates a compressed but 2-D invariant representation of an image, a supervised incremental learning system that classifies the preprocessed representations int...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001